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Abstract — Flash memory is a non-volatile computer memory 
comprised of blocks of cells, wherein each cell is implemented 
as either NAND or NOR floating gate. NAND flash is currently 
the most widely used type of flash memory. In a NAND flash 
memory, every block of cells consists of numerous pages; rewrit- 
ing even a single page requires the whole block to be erased 
and reprogrammed. Block erasures determine both the longevity 
and the efficiency of a flash memory. Therefore, when data in 
a NAND flash memory are reorganized, minimizing the total 
number of block erasures required to achieve the desired data 
movement is an important goal. This leads to the flash data 
movement problem studied in this paper. We show that coding 
can significantly reduce the number of block erasures required 
for data movement, and present several optimal or nearly optimal 
data-movement algorithms based upon ideas from coding theory 
and combinatorics. In particular, we show that the sorting-based 
(non-coding) schemes require at least 0(n log n) erasures to move 
data among n blocks, whereas coding-based schemes require only 
0(n) erasures. Furthermore, coding-based schemes use only one 
auxiliary block, which is the best possible, and achieve a good bal- 
ance between the number of erasures in each of the n + 1 blocks. 

Index Terms — coding theory, combinatorics, data storage, flash 
memory, graph theory, wear leveling 



I. Introduction 

FLASH memory devices have become the most widely 
used non-volatile electronic memories. There are two ba- 
sic types of such devices: NAND and NOR flash memories [8]. 
Between them, NAND flash is currently used much more often 
due to its higher data density. In a NAND flash, floating-gate 
cells are organized into blocks. Each block is further parti- 
tioned into multiple pages, and every read or write operation 
accesses a page as a unit. Typically, a page has 2KB to 4KB of 
data, and 64 or 128 pages comprise a block [8]. Flash memo- 
ries have a unique block erasure property: although every page 
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can be read and written (for the first time) individually, rewrit- 
ing a page (that is, modifying its contents) requires the whole 
block to be erased and then reprogrammed. Typically, every 
block can endure 10 4 ~10 5 erasures, after which the flash 
memory no longer meets quality guarantees and may break 
down. Block erasures also degrade the quality of the cells, in- 
troduce errors in the data, and reduce the overall read and write 
performance. Therefore, it is critical to minimize the number 
of block erasures. It is also critical to balance the number 
of erasures across different blocks. For this reason, numerous 
wear leveling techniques are widely used in flash-memory sys- 
tems. The general idea is to balance erasures by migrating data 
to different locations, especially when data are rewritten J§). 

In wear leveling, it is often desirable to move the frequently 
changing data (so-called hot data) into the same blocks, while 
storing the mostly static data together in other blocks. Thereby 
the overall erasures caused by the hot data can be reduced 
(see HO, |9])- The specific locations to which the data are 
moved can be optimized not only based on the update frequen- 
cies, but also on the correlation among the data. Another im- 
portant application where data movement is required is defrag- 
mentation of files. Many file systems (and database systems) 
implemented in flash take the log-structured approach, wherein 
updates to files are stored non-consecutively across blocks. 
This way, wear leveling is achieved and local block erasures 
are avoided [5]. Consequently, files are frequently fragmented. 
To improve performance, data have to be moved periodically 
in order to reorganize the file segments. In database systems or 
sensors, after bursty incoming data flows are reliably stored, 
data movement is used to store the data in a categorized 
manner for efficient analysis. To facilitate data movement, 
a flash translation layer (FTL) is usually employed to map 
logical data pages to physical pages in the flash memory JBJ- 
Minimizing the number of block erasures incurred during the 
data movement process remains a major challenge. 

In this paper, we show that coding techniques can signifi- 
cantly reduce the number block erasures incurred during data 
movement. In addition to the overall number of erasures, we 
also consider other parameters, such as coding complexity and 
extra storage space (number of auxiliary blocks). We show that 
without coding, at least two auxiliary blocks are needed to 
enable data movement, and present a sorting-based solution 
that requires 0(n log n) block erasures in order to move data 
among n blocks. With coding, only one auxiliary block is 
needed. We present a very efficient data-movement algorithm 
based on coding over GF(2) that requires only 2n erasures. 
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We also present another coding-based algorithm that requires 
at most 2n — 1 erasures, which is worst-case optimal. Although 
minimizing the number of erasures for every instance of data 
movement is NP hard, as we prove in Section [V-BI both algo- 
rithms achieve an approximation ratio of 2 with respect to the 
minimum possible number of erasures for the given instance. 

We note that a number of papers on coding for flash mem- 
ories have recently appeared in the literarture. These include 
codes for efficient rewriting [3|, |7), iflOl . |[T5ll (also known as 
floating codes or flash codes), error-correcting codes [4 |, and 
rank-modulation codes for reliable cell programming fTD . iTOl . 
However, to the best of our knowledge, this paper is the first 
to address storage coding at the page level instead of the 
cell level. Furthermore, our topic of study, namely the data- 
movement problem, is also distinct from all previous works. 

The rest of the paper is organized as follows. In Section UT1 
we define the data movement problem and introduce the 
relevant notation. In Section [TTTJ sorting -based data movement 
algorithms are presented. We further show in Section [III] that 
coding can help minimize the auxiliary storage requirements 
during data movement. In Section |IV] we develop an efficient 
data-movement algorithm, based on coding over GF(2). This 
algorithm uses only 2n erasures in order to move data among 
n blocks. In Section [V] we present an alternative coding-based 
algorithm, which uses at most 2n — 1 erasures and is worst- 
case optimal. The NP hardness of the general problem of min- 
imizing the number of erasures in data movement (for every 
given instance) is also established in [V] Finally, Section [VI] 
contains our concluding remarks. 

II. Terms and Concepts 

In this section, we formally define the data movement problem, 
and present some useful concepts. 

Definition 1. (Data Movement Problem) 
Consider n blocks storing data in a flash memory, and suppose 
that each block contains m pages. The n blocks are denoted 
by £>i, . . . , B n , and the m pages in block Bj are denoted by 
Pi,lr ■ ■ ■ i Vi,m f or i = 1/ ■ • ■ / n - Let oc(i, j) and f) be two 
functions: 

a(i,j) : {!,..., n} x {l,...,m} -> {l,...,n}; 
: {l,...,n} x {l,...,m} -» {l,...,m}. 

The functions a(i, j) and j) specify the desired data move- 
ment. Specifically, the data initially stored in the page Pi t j are 
denoted by Djj, and need to be moved into page fa{i,i),p(i,i)> 
for all (i, j) G {1, . . . ,n} X {1, . . . , m}. 

A given number of empty blocks, called auxiliary blocks, can 
be used in the data movement process, and they need to be 
erased in the end. To ensure data integrity, at any moment of 
the data movement process, the data stored in the flash memory 
blocks should be sufficient for recovering all the original data. 
The objective is to minimize the total number of block erasures 
in the data movement process. 

Clearly, the functions oc(i, j) and j) together have to 
form a permutation for the mn pages. To avoid trivial cases, 
we assume that every block has at least one page whose data 



need to be moved to another block (otherwise, it can be simply 
excluded from the set of the n blocks considered in the data 
movement problem). Also note that a block has to be fully 
erased whenever any of its pages is modified. 

Let us now define some terms that are used throughout the 
paper. There are two useful graph representations for the data 
movement problem: the transition graph and a bipartite graph. 
In the transition graph G = (V,E), \ V\ = n vertices represent 
the n data blocks B\, . . . , B n . If y pages of data need to be 
moved from Bj to Bj, then there are y directed edges from 
Bj to Bj in G. G is a regular directed graph with m outgoing 
edges and m incoming edges for every vertex. In the bipartite 
graph H = (V\ U V%, E'), V\ and ~V% each has n vertices that 
represent the n blocks. If y pages of data are moved from 
Bj to Bj, there are y directed edges from vertex B, 6 V\ to 
vertex Bj 6 Vi- The two graphs are equivalent but are used 
in different proofs. 

Definition 2. 

(Block-Permutation Set and Semi-Cycle) 
A set of n pages {p-i / j 1 , Vi,j v ■ ■ ■ , Pn,j„} that belong to n dif- 
ferent blocks is called a block-permutation set if 

{a{l,j l ),oc{2,i2),...,a{n ) j n )} = {1,2,... ,n}. 

■h> Vlfv ■■■ ' Vn,]„} is a block-permutation set, then the 
data they initially store, namely {D\ j 1 , E>2,/ 2 , • ■ ■ , D n j n }, are 
called a block-permutation data set. 

Let z £ {1, 2, . . . , n}. An ordered set of pages 

(Vio,jo> Ph,ji' ■ ■ ■ ' Pi'z-Wz-i) 
is called a semi-cycle if for k = 0, 1, . . . , Z — 1, we have 

a (h> ]k) — h+1 modz- 

Example 1. The data movement problem shown in Fig.Q] 
exemplifies the construction of the transition graph and the 
bipartite graph. The nm = 18 pages can be partitioned into 
three block-permutation sets: 

{Pl,l, Pl,2, P3,2, Vi,2, P5,3, P6,l}, 
{P\,2, V2,l, P3,3, Vi,3, P5,2, P6,2}, 
{Pl,3, P2,3, P3,l, Pi,l, P5,l, P6,3}- 

The block-permutation sets can be further decomposed into 
six semi-cycles: 

(P5,3, P\,\, P3,2, P6,l), 
(P2,2, Pi,2); 
(P5,2, P3.3, Pl,2, P2,\, P4,3, Pb,2)) 
i.V\,3), 
{P2,3, V3,\, P4,l), 
(P5,l, Pb,3)- 

□ 

Every semi-cycle corresponds to a directed cycle in the 
transition graph, and every block-permutation set corresponds 
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Fig. 1. Data movement with n — 6,m = 3. (a) The permutation table. The numbers with coordinates are a(i, /'), /'). For example, 

(ct(l,l), /3(1, 1)) = (3,3), and (flc(l,2),/5(l,2)) = (2,1). (b) The transition graph, (c) The bipartite graph representation. The n thick edges are a perfect 
matching (a block-permutation set), (d) After removing a perfect matching from the bipartite graph. Here for i = I, . . , ,n, vertex i represents block B;. 



to a set of directed cycles that enter and leave every vertex 
exactly once. It is not a coincidence that the nm pages in the 
above example can be partitioned into m block-permutation 
sets. The following theorem shows it holds for the general 
case. 

Theorem 1. The nm pages can be partitioned into m block- 
permutation sets. Therefore, the nm pages of data can be 
partitioned into m block-permutation data sets. 

Proof: The data movement problem can be represented 
by the bipartite graph, where every edge represents a page 
whose data need to be moved into another block. (See Fig. Q] 
(c) for an example.) It is known that for every bipartite graph 
G = (V,E) with bipartition {A,B} (namely, A n B = and 
A U B = V), we have the Hall's Marriage Theorem ||6): 
The graph G contains a matching of A if and only if 
\N{S)\ > \S\ for all S C A. (Here N(S) is the set 
of vertices in the graph G that are adjacent to at least 
one vertex in S. That is, the vertices in N(S) are 
the neighbors of the vertices in S.) 
For the bipartite graph we are considering here, for i = 
1, . . . , n, any i vertices in the top layer have im outgoing edges 
and therefore are connected to at least i vertices in the bottom 
layer. Therefore, the bipartite graph has a perfect matching. 
The edges of the perfect matching correspond to a block- 
permutation set. If we remove those edges, we get a bipartite 
graph of degree m — 1 for every vertex. (See Fig. Q] (c), (d).) 
With the same argument, we can find another perfect matching 
and reduce the bipartite graph to regular degree m — 2. In this 
way, we partition the nm edges into m block-permutation sets. 

■ 

A perfect matching in the bipartite graph can be found using 
the Ford-Fulkerson Algorithm IPT41 for computing maximum 
flow. The idea is to connect all the n top-layer vertices of the 
bipartite graph to a source s and connect all the n bottom-layer 
vertices to a sink f. Then a perfect matching in the bipartite 
graph is equivalent to a maximum flow of capacity n between 



the source s and the sink t. The Ford-Fulkerson Algorithm has 
time complexity 0(n 2 m), so decomposing the nm edges in the 
bipartite graph into m perfect matchings has time complexity 
0(n 2 m 2 ). Therefore, we can partition the nm pages into m 
block-permutation sets in time 0(n 2 m 2 ). 



III. Coding for Minimizing Auxiliary Blocks 

In this paper, we focus on the scenario where as few auxiliary 
blocks as possible are used in the data movement process. In 
this section, we show that coding techniques can minimize the 
number of auxiliary blocks. Afterwards, we will study how to 
use coding to minimize block erasures. 

A. Data Movement without Coding 

When coding is not used, data are directly copied from page 
to page. The following simple example shows that, in the 
worst case, more than one auxiliary block is needed for data 
movement. Note that Dj s denotes the data initially stored in 
the page p f/ y. 

Example 2. Let n = m = 2, and let the functions a(i, j) and 
jS(i,;) be: 

(«(1,1),0(1,1))=(1,1), 

(«(1,2), 0(1,2)) = (2,2), 

(«(2,1),/3(2,1))= (2,1), 

(«(2,2), 0(2,2)) = (1,2). 

It is simple to verify that without coding, there is no way 
to move the data as requested with only one auxiliary block. 
To see that, assume that only one auxiliary block Bq is used. 
Without loss of generality, assume that we first copy the data 
in £>i - the data Dj \ and D\ 2 - into Bo, an d Lnen erase B\. In 
the next step, the only reasonable choice is to write into B\ the 
data Dj 1 and D2,2 (which are the data we want to eventually 
move into B{). After this writing, Bq has Dj 1 and Dj %, B\ 
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has D\ i and D2 2, and B2 has D24 and D2 2- The objective 
of the data movement has not been met yet. However, we can 
see that there is no way to proceed: in the next step, if we 
erase Bo, the data D\ 2 will be lost; if we erase £>2, the data 
D24 will be lost. So the data movement fails. It is simple to 
verify that no feasible solution exists. Therefore, at least two 
auxiliary blocks are needed. □ 

We now show that two auxiliary blocks are sufficient for 
data movement without coding. The next algorithm uses two 
auxiliary blocks, which are denoted by Bo and B Q . It operates 
in a way similar to bubble sort. And it sorts the data of the m 
block-permutation data sets in parallel. 

Algorithm 1. (Bubble-sort-based Data Movement) 
Decompose the nm pages of data into m block- 
permutation data sets. 

Fori = 1, . . . , n — 1 
{ 

For j = i + 1, . . . , n 
{ 

Copy all the data of B, into Bq; 
Copy all the data ofBj into B' ; 
Erase B, and Bj; 
Fork = 1, . . . , m 
{ 

Let Dj 1/ j 1 and Dj 2/ j 2 be the two pages of data in Bq 
and Bq, respectively, that belong to the k-th block- 
permutation data set. 

Let pj t j 3 be the unique page in Bj such that when the 
data movement process ends, the data stored in p;j 3 
will be from the k-th block-permutation data set. 

If oc(\.2, 72) = ' (which implies fi(i2, ji) = 73 an d 
ji) 7^ i), copy the data Dj 2 j 2 into the page Pi t j 3 ; 
otherwise, copy the data Dj 1 t x into the page Vi,] y 

} 

Write into Bj the m pages of data that are in Bq or B Q 
but not in Bj. 
Erase Bq and B Q . 

} 

} 

In the above algorithm, for every block-permutation data 
set, its data are not only sorted in parallel with other block- 
permutation data sets, but are also always dispersed in n 
blocks (with every block holding one page of its data). The 
algorithm uses 0(n 2 ) erasures. (The n blocks B\, . . . , B n 
are each erased n — 1 times, while the two auxiliary blocks 
Bq and B Q are each erased (2) times.) If instead of bubble 
sorting, we use more efficient sorting networks such as the 
Batcher sorting network J2) or the AKS network JT], the 
number of erasures can be further reduced to 0(n log 2 n) and 
O(nlogft), respectively. For simplicity we skip the details. 



B. Storage Coding with One Auxiliary Block 

In Algorithm [TJ the only function of the two auxiliary blocks 
Bo and Bq is to store the data of the data blocks B\, By when the 
data in Bj, Bj are being swapped. We now show how coding 
can help reduce the number of auxiliary blocks to one, which 
is clearly the best possible. Let Bq denote the only auxiliary 
block, and let poj, Po,2/ • • • / Po,m denote its pages. For k = 
1, . . . , m, statically store in page po,jc me bit-wise exclusive- 
OR of the n pages of data in the k-th block-permutation data 
set. We make such a change in Algorithm [TJ 

When the data in Bj, Bj are swapped, instead of erasing 
them together, we first erase Bj and write data into Bj, then 
erase Bj and write data into Bj. 

This is feasible because for every block-permutation data 
set, there are always at least n pages of data related to it 
stored in the n + 1 blocks: n — 1 pages of those data are 
the original data in the block-permutation data set, and the 
other page of data are the bit-wise exclusive-OR of the data 
of the block-permutation data set. The total number of block 
erasures here is of the same order as the algorithm without 
coding. Therefore, if the AKS network is used for swapping 
the data, O(nlogn) block erasures will be used in total. 



IV. Efficient Storage Coding over GF(2) 

In this section, we present a data movement algorithm that uses 
only one auxiliary block and In erasures. It erases every block 
either once or twice, which is well balanced. The algorithm 
uses coding over GF(2) and is very efficient. 

For convenience, let us assume for now that every block has 
only one page. The results will be naturally extended to the 
general case where every block has m pages. Let Bo denote the 
auxiliary block, and let po denote its page. For i = 1, . . . ,n, 
let pj denote the page in the block Bj, and let D, denote the 
data initially stored in the page p,-. Let 

oc: {l,...,n} -> {l,...,n} 

be the permutation such that the data D, need to be moved 
into the page p a ^y Let aT 1 be the inverse permutation of a. 
Say that the n pages p\, pi, . . . , p n can be partitioned into t 
semi-cycles, denoted by 

C\, C2, ■ ■ ■ , Cf. 

Every semi-cycle Q (1 < i < f) has a special page called 
tail, defined as follows: if pj is the tail of Q, then for every 
other page G Q, we have j > k. 

We use "©" to represent the bit-wise exclusive-OR of data. 
The following algorithm uses 2n block erasures to move 
data. It consists of two passes: the forward pass and the 
backward pass. Note that in the algorithm below, whenever 
some data are about to be written into a page, the data can 
be efficiently computed from the existing data in the flash 
memory blocks (namely, from the data currently stored in the 
flash memory). The details will be clear later. Also note that 
for i — \,2, . . . ,n, D a -i(n is the data that need to be moved 
into the page p,. 



Jiang, Mateescu, Yaakobi, Bruck, Siegel, Vardy, and Wolf: STORAGE CODING FOR WEAR LEVELING IN FLASH MEMORIES 



5 



Bp | B x | B 2 | B 3 | B 4 \B 5 \ B 6 \B 7 \B g \\ Operation" 



forward pass 





1 


2 


3 


4 


5 


6 


7 


8 


Si ©6 4 


1 © 4 




2 


3 


4 


5 


6 


7 


8 


5 2 ©5 5 


104 


2©5 




3 


4 


5 


6 


7 


8 


<5 3 ® <5o ® <$4 


1©4 


2©5 


301 




4 


5 


6 


7 


8 




1®4 


2©5 


301 


4® 7 




5 


6 


7 


8 




104 


2©5 


301 


4® 7 


5©6 




6 


7 


8 


copy <5 6 


104 


2©5 


301 


4® 7 


5®6 


6 




7 


8 


57 0^8 


1©4 


2©5 


301 


4® 7 


5®6 


6 


7®8 




8 


copy S s 


104 


2®5 


301 


4® 7 


5®6 


6 


7®8 


8 






backward pass 


1©4 


2©5 


3®1 


4©7 


5®6 


6 


7 © 8 


8 




5 7 © <5 6 © <5 3 © <5 © <5 2 


104 


2©5 


301 


4® 7 


5®6 


6 


7©8 




3 


<5 6 © <5 3 © <5 © <5 2 © <5 8 


104 


2®5 


301 


4©7 


5®6 


6 




8 


3 


<5s © <*4 © "5i 


1©4 


2®5 


301 


4©7 


5®6 




2 


8 


3 


<5 4 © ^ © «5 6 


104 


2©5 


301 


4® 7 




6 


2 


8 


3 


<5 3 © ^0 © <52 © <5 8 


1©4 


2®5 


301 




7 


6 


2 


8 


3 


<5 2 ©,5 8 


1©4 


2®5 




1 


7 


6 


2 


8 


3 


Si ©<5 6 


1©4 




5 


1 


7 


6 


2 


8 


3 


S ® S 3 




4 


5 


1 


7 


6 


2 


8 


3 





Fig. 2. Example execution of Algorithm [2] In the rightmost column, <5; 
denotes the data in the page p, at that moment, for i = 0,1, ... ,8. 



Algorithm 2. (GF(2)-coding-based data movement) 
Forward Pass: 
Fori = 1,2, ... ,n do: 
If pi is not the tail of its semi-cycle, write 

D,-©D a -i (0 

into the page p;_i; otherwise, write 

into the page Then, erase the block B, . 

Backward Pass: 
For i = n, n — 1, . . . , 1 do: 
Write 

D «-i(0 

into the page p t . Then, erase the block B;_i . 

Example 3. Figure [2] gives an example of the execution of 
Algorithm |2] with n — 8 and t = 2. Here 

(flt(l) / a(2) / ... / «(8)) = (3,6,8,1,2,5,4,7). 

Consequently, we have 

(a- 1 (l),«- 1 (2),...,«- 1 (8)) = (4,5,1,7,6,2,8,3). 

The two semi-cycles are (pi, p 3 , pg, pj, p£) and {jpi, pd, p$). 
In Figure [2] each row is a step of Algorithm [2] The numbers 
are the data in the blocks. (For convenience, we use i to denote 
data Dj in the figure for i — 1, 2, . . ., 8.) The rightmost column 
describes the computation performed for this step, where <5, 
denotes the data in p; then. □ 

The correctness of Algorithm|2]depends on whether the data 
written into a page can always be derived from the existing 
data in the flash memory blocks. Theorem|2] shows this is true. 

Theorem 2. When Algorithm [2] is running, at any moment, 
for i = 1,2, ... ,11, if the data Dj are not stored in the n + 1 
blocks Bq, B\, . . . , B n , then there must exist a set of data 

{Di © D h , D h © D j2 , D h © D /s , . . . , D jk l © D k , D k } 



that are all stored in the n + 1 blocks. Therefore, D; can be 
easily obtained by computing the bit-wise exclusive-OR of the 
data in the set. 

Proof: Consider a semi-cycle Q (1 < I < t), and let us 
denote its pages by 

Vh-Viv ■ ■ ■ >Vif 
Without loss of generality (WLOG), assume 

*(*/) = 
for j = 1,2, . . . , x — 1 and 

ocii x ) = i\- 

Also assume that p, is the "tail" of the semi-cycle, namely, 
z'l > ij for j — 2, 3, . . . , X. Now imagine a directed path S as 
follows: 

1) S has x vertices, representing the data D; r D, 2 , . . ., D, x ; 

2) There is a directed edge from Dj. to D,. +1 for j = 
1,2, . . . , x — 1. The edge represents the data 

For example, the data movement problem in Example [3] 
has two semi-cycles, (p 1 , p 3 , p s , p 7 , p 4 ) and (p 2 , p 6 , ps)- We 
show the corresponding directed path S in Fig. [3] (a) and (b). 

Consider the forward pass in the algorithm. In this pass, 
for j = 2,3, ... ,x, right before the data are erased, the 
data Dj. _ 1 © D;^. are stored. Note that D,^ corresponds to a 
vertex in the directed path S, and D;._j © D,. corresponds to 
the directed edge entering that vertex in S. So for every vertex 
in S whose corresponding data have been erased, there is a 
directed sub-path in S entering it with this property: "the data 
represented by the edges in this sub-path, as well as the data 
represented by the starting vertex of the sub-path, are all stored 
in the n + 1 blocks." This is the same as the condition stated 
in the theorem. (For instance, for the data movement problem 
in Example [3] after three block erasures, the stored and un- 
stored data are as shown in Fig. 0(c). As an example, consider 
the erased data D3. The corresponding sub-path entering it 
contains the data D4, D4 © D1 and D\ © D3, which are stored 
and can be used to recover D3.) 

When the forward pass of the algorithm ends, the data 
represented by the vertex D,- and all the edges in S are all 
stored in the n + 1 blocks. Clearly, all the original data can 
be recovered. 

Now consider the backward pass in the algorithm. In this 
pass, first, the data Dj x are stored and then the data D ;i are 
erased. Then, for j = 1,2, . . . ,x — 1, right before the data 
Dj . © Dj . , are erased, the data D; . are stored. Note that Dv . 
corresponds to a vertex in the directed path S, and Dj. © 
corresponds to the directed edge leaving that vertex in S. So 
for every vertex in S whose corresponding data have been 
erased, there is a directed sub-path in S leaving it with this 
property: "the data represented by the edges in this sub-path, 
as well as the data represented by the end vertex of the sub- 
path, are all stored in the n + 1 blocks." This is the same as 
the condition stated in the theorem. So the conclusion holds. 
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(a) 



D 4 © Dj 



D, © D 4 




D 8 © D 7 



(b) 



D, © D 




D 4 © D 



D y © D 4 




"l^D. © D, 



©- G) 

D 8 © D y 



D, © D 




D fi © O 



Fig. 3. The directed path S corresponding to a semi-cycle, whose vertices and edges represent data, (a) The directed path S for semi-cycle [pi, p$, pg, p-j, p$). 
(b) The directed path S for semi-cycle (pi, P6, Ps)- (c) The stored and un-stored data after three block erasures in the "forward-pass" of the data-movement 
algorithm. The vertices and edges of solid thick lines represent the data that are stored at that moment. The vertices and edges of dashed thin lines represent 
the data that are not stored at that moment. 



Algorithm [2] can be easily extended to the general case 
where every block has m > 1 pages. Use the algorithm to 
process the m block-permutation data sets in parallel, in the 
same way as Algorithm Q] Specifically, for i = 1, . . . , n and 
j = 1, . . . ,m, let Pi r k(i,j) denote the unique page in B, such 
that some data in the /-th block-permutation data set need 
to be moved into pgt^y). In the algorithm, every time B z - is 
erased, write the data related to the j-th block-permutation data 
set into p ; - ^ :y Since every block-permutation set occupies 
exactly one page in each block, there will be no conflict in 
writing. 

V. Storage Coding with Minimized Number of 
Erasures 

In this section, we present an algorithm that uses at most 2n — 
1 erasures, which is worst-case optimal. It erases every block 
either once or twice, which is well balanced. We further show 
that it is NP hard to minimize the number of erasures for every 
given instance, but our algorithm provides a 2-approximation. 

A. Optimal Solution with Canonical Labelling 

The n blocks initially storing data can be labelled by 
Bi,...,B n in n\ different ways. Let y be an integer in 
{0,1,..., n — 2}. We call a labelling of the n blocks that 
satisfies the following constraint a canonical labelling with 
parameter y: 

"For i = y + 1, y + 2, . . . , n — 1 and j = i + 2, i + 3, . . . , n, 
no data initially stored in the block Bj need to be moved into 
the block B;." 

Trivially, any labelling is a canonical labelling with param- 
eter n — 2. However, given an instance of the data movement 
problem, it is difficult to find a canonical labelling that 
minimizes the value of y. 

We now present a data-movement algorithm for blocks 
that have a canonical labelling with parameter y. It uses one 
auxiliary block Bg, and uses 



y- 



K2n-\ 



erasures. (So the smaller y is, the better.) For convenience, let 
us again assume that every block contains only one page, and 
let p,-, Dj, a, a^ 1 be as defined in the previous section. Let r 
denote the number of bits in a page. Q| The algorithm can be 

1 When r is greater than what is needed by Algorithm fj] (which is nearly 
always true in practice), we can partition each page into bit strings of an 
appropriate length, and apply the algorithm to the strings in parallel. 



naturally extended to the general case, where every block has 
m > 1 pages, in the same way as introduced in the previous 
section. 

Algorithm 3. (Data Movement with Linear Coding) 
This algorithm is for blocks that have a canonical labelling with 
parameter y 6 {0, 1, — 2}. Lety\, yi, ■ ■ ■ , Yn be distinct 
non-zero elements in the field GF(2 r ). 

Step 1 : For i = 0, 1, . . . , y do: Erase B/ (for i = there is 
no need to erase Bq), and write into pj the data Yiflk- 

Step 2: For i — y + 1, y + 2, . . . , n do: Erase Bj, and write 
into pj the data D^-i/^. 

Step 3: For i — y, y — 1, . . ., 1 do: Erase B/, and write into 
the page pi the data D^-tw. Finally, erase Bq. 

Theorem 3. Algorithm^ is correct and uses 



y- 



K2n 



erasures. (Note that the algorithm assumes that the blocks 
have a canonical labelling with parameter y.) 

Proof: We show that each time a block B, is erased, it 
is feasible to generate all the n pages of original data using 
the current data stored in the other n pages. Denote by 5j, 
< i < n, the current data stored in the page p,-, which 
are a linear combination of the n pages of original data. The 
linear combination written in each page can be represented by 
a matrix multiplication 

H-{D lr D 2 ,...,D n ) T = (6 0r ... / 6 i _ 1 ,5 i+1/ ...,5 n ) T . 

The matrix H defines the linear combination of the original 
data written into each page. Consider the first step of the 
algorithm when the block B, is erased. The data written in 
Ph, for < h < i — 1, are 

h = I r[ J D fc , 
k=l 

and the data stored in p/,, for i + 1 < h < n, are 
S h = D h . 

The matrix representation of this problem is 



/ 1 



1 



71 72 
2 2 

n 72 



Yn 



D 2 
D 3 



y-l 
T n 



\ °(n-i)> 



Dn-1 



V 6n J 
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where 



(n-i)xi 



is the zero matrix of size (n — i) x z, and I n _ 



is the unit matrix of size (ft — z) x (ft — i). Since this matrix 
is invertible, it is feasible to generate all the original data and 
in particular, the required data that need to be written into p/. 

For i = y + 1, y + 2, . . . , n, after erasing the block B, 
during the second step of the algorithm, the data stored in 
Ph, for < /; < y, are 6 h = Lj? =1 y[ ! Dj-. The data written 
into p/j, for y + 1 < h < i — 1, are <5;, 
data stored in p/,, for i + 1 < h < ft, are 6;, 
equations are represented as follows: 



D„-i/ ft ), and the 
Di,. These 



/ 1 1 

Tl 72 
2 2 



y y 



1 \ 

7h 
7^ 



7;; 



Di 
D 2 
D 3 



&i-i 



is a matrix of size (n — y - 



1) x n defined as 



V 

where A„_ 
follows: 

1) The h-th row of the matrix A„_, for 1 < h < i — y — 
1 is a unit vector of length n containing a one in its 
(a" 1 (y + h))-th entry. 

2) The h-th row of the matrix A„_; for i — y < h < 
n — y — 1 is a unit vector that contains a one in its 
(y + h + l)-th entry. 

Since there are no data that are moved from block Bj to block 
B;, where y+l<i<n — 2 and i + 2 < j < n, the first 
i — y — 1 row vectors of the matrix A„_; are different from 
the last n — i row vectors of the matrix A„_,'. Therefore, the 
matrix A„_, contains a set of unit vectors where all the vectors 
are different from each other. If we calculate the determinant 
of the matrix on the left hand side according to the rows of 
the matrix A„_,, then we are left with an (y + 1) x (y + 1) 
matrix of the form: 



/111 

T n Yi 2 7i 3 

2 2 2 
'l 1 Hi r '3 



VUj Viy+1 
2 2 



y y 



and its determinant is not zero because it is a Vandermonde 
matrix. Therefore, the matrix on the left hand side is invertible, 
and it is feasible to generate all the original data D,, 1 < i < 
n, and in particular the data D a -i^ that need to be written 
into the page p,. 

For i = y, y — 1, . . . , 1, after erasing the block B, during 
the third step of the algorithm, the data stored in p;,, for < 
h < i — 1, are 5f, = E"=i Tk^h an d me data stored in p/,, 



for i ' + 1 < h < n, are <5;, = D 
representing this equations is 



a-!(/0 



Therefore, the matrix 



/ 1 

7l 

rl 



l 

72 

rl 



1 \ 

Yn 

Yl 



Yn 



Di 
D 2 
D 3 



D„-! 

\ D n ) 



\ Sn J 



where P„_,- is a matrix consisting of n — i row vectors of 
length n, and its h-th row vector, 1 < h < n — i, is a unit 
vector of length n which has a one in its a -1 (z + /i)-th entry 
and zero elsewhere. As before, all the unit vectors in the matrix 
P n -i are different from each other. Therefore the matrix on the 
left hand side is invertible, and it is feasible to generate all the 
original data D ( , 1 < i < n, and in particular the data D a -i^ 
that need to be written into the page p,. ■ 
The following theorem shows an interesting property of 
canonical labelling. Note that since every block has some 
data that need to be moved into it from some other block, 
every block needs to be erased at least once. So at least n + 1 
erasures (including erasing the auxiliary block) are needed in 
any case. 

Theorem 4. Assume r - the number of bits in a page - is 
sufficiently large. Let y £ {0, 1, • • • , n — 2}. There is a data- 
movement solution using 

n + y + 1 

erasures if and only if there is a canonical labelling of the 
blocks with parameter y. 

Proof: First, assume that there is a data-movement solu- 
tion using n + y + 1 erasures. Since every block (including the 
auxiliary block) is erased at least once, there are at least n — y 
blocks that are erased only once in the solution. Pick n — y 
blocks erased only once and label them as By + i, By + 2/ ■ • ■ / B H 
this way: "in the solution, when y + 1 < i < j < n, Bj 
is erased before B;." Label the other y blocks as B\,...,By 
arbitrarily. Let us use contradiction to prove that no data in Bj 
need to be moved into B,, where i > y + 1, j > i + 2. 

Assume some data in Bj need to be moved into B,-. After 
B ( - is erased, those data must be written into B,- because B ( - 
is erased only once. When the solution erases B 1+ i (which 
happens before By is erased), the data mentioned above exist 
in both Bj and Bj. So at this moment, there are at most nm — 1 
pages of distinct data; however, it is impossible to recover all 
the nm pages of original data using only nm — 1 pages of 
distinct data. So there is a contradiction. Therefore, with the 
above labelling, we have already found a canonical labelling 
with parameter y. The other direction of the proof comes from 
the existence of Algorithm [3] ■ 

We can easily make Algorithm [3] use 2n — 1 erasures by 
letting y = n — 2 and using an arbitrary block labelling. On 
the other hand, 2n — 1 erasures are necessary in the worst 
case. To see that, consider an instance where m > n and 
every block has some data that need to be moved into every 
other block. For such an instance, a canonical labelling has to 
have y = n — 2, which implies n + y + 1 = 2n — 1 erasures 
by Theorem HI So Algorithm [3] is worst-case optimal. 

B. Optimization for Each Given Instance 

A specific instance of the data movement problem may require 
less than 2n — 1 erasures. So it is interesting to find an 
algorithm that minimizes the number of erasures for each given 
instance. The following theorem shows that this is NP hard. 

Theorem 5. For the data movement problem, it is NP hard to 
minimize the number of erasures for every given instance. 
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Fig. 4. NP hardness of the data movement problem, (a) A simple undirected graph Go- (b) The corresponding regular directed graph G'. Here every edge 
between two different vertices has arrows on both sides, representing the two directed edges of opposite directions between those two vertices. There is a 
symbol X £ beside every directed loop, representing i parallel loops of that vertex. 



Proof: It has been shown in Theorem H] and its proof 
that minimizing the number of erasures is as hard as finding a 
canonical labelling for the blocks with a minimized parameter 
y. So we just need to show that finding a canonical labelling 
with minimized y is NP hard. We prove it by a reduction from 
the NP hard Maximum Independent Set problem. 

Let Go = (Vq,Eq) be any simple undirected graph. Let 
d{v) denote the degree of vertex v £ Vq and let A = 
maXj, e y d(v) denote the maximum degree of Go- We build 
a regular directed graph G' = (Vj U Vi U V3, E') as follows. 
Let I V | = |^l = ^2 = |^3 1- For all v £ Vq, there are 
three corresponding vertices v\ £ V\,X>i £ 1^2/^3 S V3. If 
there is an undirected edge between u, v £ Vq in Go, then 
there are two directed edges of opposite directions between w, 
and V; for i = 1, 2, 3 and j = 1, 2, 3. For all v £ Vq, there are 
also two directed edges of opposite directions between V\,vi 
and between Vi, v 3 . Add some loops to the vertices in G' 
to make all vertices have the same out-degree and in-degree 
3A + 2. See Fig. [4] for an example. 

The graph G' naturally corresponds to a data movement 
problem with n = 3\Vq\ and m = 3 A + 2, where G' is its 
transition graph. (The transition graph is defined in Sectionllll) 
Finding a canonical block labelling with minimized parameter 
y for this data movement problem is equivalent to finding 
t = n — y vertices - with the value of t maximized - in G', 

a 1 ,a 2 ,...,a t , 

such that for i = 1, 2, . . . , f — 2 and = i + 2, i + 3, . . . , f, 
there is no directed edge from aj to a,-. We call such a set 
of t vertices - with t maximized - the MAXIMUM Semi- 
INDEPENDENT Set of G'. For all v £ V , let N(v) denote 
the neighbors of v in Gq. 

Claim 1: "There is a maximum semi-independent set of 
G' where V v £ Vq, either all three corresponding vertices 
V\ £ V\,Vi £ Vi,v^ £ V3 are in the set, or none of them is 
in the set. What is more, if V\,Vi,v 3 are m the set , then no 
vertex in {w\, Wi, w 3 \w £ N(v)} is in the set." 

To prove Claim 1, let (a\,ai, . . . ,ai) denote a maximum 
semi-independent set (MSS) of G'. (Note that the order of the 
vertices in the set matters.) Consider two cases: 



Case 1: One of {v lr v 2 , v 3 } is in the MSS of G'. WLOG, 
say it is v\. At most two vertices - say b and c - in 
{w\, W2, w^\w £ N(v)} can be in the MSS, because oth- 
erwise due to the bi-directional edges between them and V\, 
there would be no way to place them in the MSS. Let us 
remove b, c from the MSS and add v^, v 3 right after V\ in the 
MSS. It is simple to see that we get another MSS. 

Case 2: Two of {v lr v 2 , v 3 } are in the MSS of G' . WLOG, 
say they are v\ and v^. At most one vertex - say b - in 
\w\,Wi,w 3 \w £ N(v)} can be in the MSS, for a similar 
reason as Case 1. In the MSS, let us remove b, move v-i right 
behind V\, and add c 3 right behind v-i- Again, we get an MSS. 

So in this way, we can easily convert any MSS into an MSS 
satisfying the conditions in Claim 1. So Claim 1 is true. 

Claim 2: "A set of vertices {w(l),w{2), . . . ,w{k)} is 
a maximum independent set of Go if and only if the set 
of vertices (w{l)\, w{l)2,w{l)^,wil)\,w{2)2, w{2)^, .. ., 
w(k)i,w(k)2,w(k)o,) is an MSS of G'." It is simple to see 
that this is a consequence of Claim 1. 

So given a canonical labelling with minimized parameter y 
for the data movement problem with G' as the transition graph, 
in polynomial time we can convert it into an MSS of G', from 
that into an MSS of G' satisfying the conditions of Claim 1, 
and finally into a maximum independent set of G. So it is NP 
hard to find a canonical labelling with minimized parameter 
y. So minimizing the number of erasures is NP hard. ■ 

Therefore, there is no polynomial time data-movement al- 
gorithm that minimizes the number of erasures for every given 
instance unless P = NP. However, since every algorithm uses 
at least n + 1 erasures, and Algorithm [3] can easily achieve 
2n — 1 erasures (by setting y = n — 2), we see that the 
algorithm is a 2-approximation algorithm. 



VI. Conclusions 

In this paper, we study the data movement problem for NAND 
flash memories. We present sorting-based algorithms that do 
not utilize coding, which can use as few as O(ftlogft) era- 
sures for moving data among n blocks. We show that coding 
techniques can not only minimize the number of auxiliary 
blocks, but also reduce the number of erasures to 0(n). In 
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particular, we present a solution based on coding over GF(2) 
that requires only In erasures. We further present a linear- 
coding solution that requires at most 2n — 1 erasures, which 
is worst-case optimal. Both solutions based on coding achieve 
an approximation ratio of two with respect to the minimum 
possible number of block erasures for each instance. They also 
balance the number of erasures in different blocks very well. 

The data movement problem studied here can have numer- 
ous practical variations. In one variation, the data to be moved 
into each block are specified, but the order of the data in that 
block is allowed to be arbitrary. The algorithms presented in 
this paper can easily solve this variation of the problem by first 
assigning an arbitrary page order to each block (which does not 
affect the performance of the algorithms). In another variation, 
we are only given a specification as to which group of data 
needs to be moved into the same block, without specifying 
which block. Furthermore, the final data may be a function of 
the data originally stored in the blocks. Such variations require 
new solutions for optimal performance. They remain open for 
future research. 
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